Directed Evolution of High-Affinity Antibody 
Mimics Using mRNA Display 



Lihui Xu, Patti Aha, Ke Gu, Robert G. Kuimelis, 

Markus Kurz, 2 Terence Lam, Ai Ching Lim, 

Hongxiang Liu, 3 Peter A. Lohse, 4 

Lin Sun, Shawn Weng, Richard W. Wagner, 

and Dasa Lipovsek 1 

Phylos, Inc. 

128 Spring Street 

Lexington, Massachusetts 02421 



Summary 

We constructed a library of >10 12 unique, covalently 
coupled mRNA-protein molecules by randomizing three 
exposed loops of an immunoglobulin-like protein, the 
tenth fibronectin type III domain ( 10 Fn3). The antibody 
mimics that bound TNF-a were isolated from the li- 
brary using mRNA display. Ten rounds of selection 
produced 10 Fn3 variants that bound TNF-a with disso- 
ciation constants (K d ) between 1 and 24 nM. After affin- 
ity maturation, the lowest K d measured was 20 pM. 
Selected antibody mimics were shown to capture TNF-a 
when immobilized in a protein microarray. 10 Fn3-based 
scaffold libraries and mRNA-display allow the isolation 
of high-affinity, specific antigen binding proteins; po- 
tential applications of such binding proteins include 
diagnostic protein microarrays and protein thera- 
peutics. 

Introduction 

Antibodies and other reagents that bind to their targets 
with high affinity and specificity are in increasing demand 
as the focus of genomic research expands to include gene 
products. We designed a library of antibody mimics 
based on the structure of the tenth fibronectin type III 
domain ( 10 Fn3), then used the recently-described 
method of mRNA display [1-8] to select specific affinity 
reagents of subnanomolar affinity. 

Like ribosome display [9, 10], RNA display does not 
require the transformation of library DNA into a microor- 
ganism. As a consequence, the size of the libraries made 
using these methods is not limited by the transformation 
efficiency, and libraries with 10 12 -10 13 different se- 
quences have been constructed [5, 9, 11]; in contrast, 
the largest phage library reported to date contains 10 11 
different sequences [12]. RNA display has the added 
advantage that the association between the mRNA 
strand and the polypeptide encoded by that mRNA is 
covalent, which makes possible the selection under 
stringent conditions, such as at elevated temperature. 
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Due to the high performance of the natural immune 
response, the first artificial affinity reagents were based 
on antibodies and on antibody fragments. The now rou- 
tine production of polyclonal and monoclonal [13] anti- 
bodies in vivo exploits an animal immune system to 
generate a diverse pool of antibodies and to select those 
antibodies that bind to the target of interest. In contrast, 
an in vitro selection of antibodies requires the creation 
of DNA libraries by cloning the natural immune repertoire 
of naive or of preimmunized animals [14, 15] or by ran- 
domizing the positions in cloned antibody genes that 
encode the complementarity-determining regions [16, 
17]; target binding antibodies are then selected from 
such libraries under biochemically defined conditions. 
Due to the challenges associated with the manipulation 
of proteins that are the size and the complexity of anti- 
bodies, most of the in vitro development of antibody- 
based affinity reagents has focused on antibody fragments 
such as F ab fragments [18-21], single-chain antibodies 
[11, 14-16, 22, 23], helix-stabilized antibody fragments 
[24], and isolated V H [25-33] and V L domains [34-36]. 
In addition, some specific affinity reagents have been 
selected from peptides [37-42] and from small, globular 
protein scaffolds not related to antibodies [43-53]. Over- 
all, the scaffolds related to natural antibodies have 
yielded molecules of higher affinity than did the short 
peptides and the protein scaffolds not related to anti- 
bodies. 

We chose the tenth type III domain of human fibronec- 
tin fFnS), a 94 residue (10 kDa) structural protein [54, 
55] with an immunoglobulin-like fold (Figure 1) [56-59] as 
the antibody-mimic scaffold. The properties that make 
10 Fn3 a good scaffold candidate include exceptional 
thermostability (T m = 90°C) and solubility (>15 mg/ml), 
high expression level in E. coli (50 mg/l), and the lack 
of disulfide bonds or of unpaired cysteines in its struc- 
ture [60]. The stability of the wild-type 10 Fn3 can be 
improved further by the removal of unfavorable surface 
electrostatic interactions [61]. Three of the solvent- 
accessible loops in 10 Fn3, named BC (residues 21-30), 
DE (residues 51 -56), and FG (residues 76-87), are struc- 
turally analogous to the V H complementarity-determin- 
ing regions (CDR) H1 , H2, and H3, respectively (Figure 
1). The fact that the sequences of loops BC, DE, and 
FG are less conserved between different Fn3 domains 
than is their |J sheet framework [57] suggests that a 
diversity artificially introduced into these three loops 
may not disturb the overall fold of the domain. 

The first report on the use of the 10 Fn3 scaffold as an 
antibody mimic was published by Koide et al. [62]. In 
that study, a library of 10 8 distinct, 10 Fn3-like sequences 
was made by deleting three residues (82-84) in the FG 
loop and by randomizing five residues (26-30) in the 
BC loop and four residues (77-80) in the FG loop. A 
subsequent phage display selection against immobi- 
lized ubiquitin yielded a clone that bound to the target 
with the IC 50 of 5 U.M, as determined by competition- 
phage ELISA. The selected ubiquitin binding 10 Fn3-like 
domain showed "broad binding specificity" [62], includ- 
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Figure 1. Comparison of the Primary Se- 
quences and of the Tertiary Structures of a 
Llama V HH Domain and the Wild-Type Human 
1 °Fn3 Domain 

A comparison of the primary sequences and 
of the tertiary structures of a llama V HH domain 
[70], which is the smallest antibody fragment 
known to bind to an antigen, and the wild- 
type, human 10 Fn3 domain [56], the scaffold 
for a new class of antibody mimics. Alignment 
of primary sequences (A) and structural com- 
parison (B) between these two domains dem- 
onstrates that, despite the lack of significant 
sequence identity, the V HH and the 10 Fn3 fold 
into similar (3 sheet sandwiches. The disulfide 
bond between Cys 22 and Cys 92 of the V HH 
domain is shown in yellow; there are no disul- 
fides in the 1 °Fn3 domain. The complementar- 
ity-determining regions of the V^h and the res- 



quence). Blue: CDR-H1 of the llama V„„ and 
residues 23-29 of the 10 Fn3 BC loop; green: 
CDR-H2 of the llama V„„ and residues 52-55 
of the 1 °Fn3DE loop; red: CDR-H3 of the llama 
V„„ and residues 78-87 of the 1 °Fn3 FG loop. 
In the sequence alignment, the homologous 
residues are boxed. The 1 °Fn3 residues out- 
side the randomized loops that were found 
to have mutated in approximately 45% of the 
selected clones are marked in black in the 
ribbon representation of the 1 °Fn3 st 



Vhh 



iOFn3 



ing binding todextran, and dramatically lowered stability 
and solubility when compared to the wild-type. This 
study demonstrated the potential utility of the 10 Fn3 scaf- 
fold, but it left open the question of whether the 10 Fn3 
structure was robust enough to tolerate the extensive 
sequence variation required to generate new binding 
properties [63]. 

Here we show that the combination of an alternative 
library design and of the alternative selection method 
of mRNA display, which allowed the sampling of a larger 
library, yielded Fn3-based antibody mimics that bound 
to their target with high specificity and with dissociation 
constants (K d ) in the pico- to nanomolar range. Further- 
more, the selected antibody mimics retained their bind- 
ing properties when immobilized to create a protein mi- 
croarray. 

Results and Discussion 
Library Construction 

The 10 Fn3-based library was an equimolar mixture of 
three sublibraries, the first with one randomized loop, 
FG (ten residues), the second with two randomized 
loops, BC (seven residues) and FG, and the third with 
three randomized loops, BC, DE (four residues), and FG. 



In all, 21 residues were randomized in the library with 
three randomized loops (Figure 1). 

The mixture of the three DNA libraries was transcribed 
and translated into an RNA-protein fusion library [5], which 
consisted of 1 0 12 members with unique sequences. 

Library Diversity 

Theoretically, the number of unique amino acid se- 
quences that could appear in the three sublibraries is 
1013 (20 10 ) for the sublibrary with one randomized loop, 
10 22 (20 17 ) for the sublibrary with two randomized loops, 
and 10 27 (20 21 ) for the sublibrary with three randomized 
loops. It is important to keep in mind that even our library 
of 10 12 different molecules sampled only a fraction of 
such possible sequences. 

Another challenge is to translate the sequence diver- 
sity of a synthesized DNA library into the functional di- 
versity of the protein library it encodes. Of the 32 codons 
encoded by the sequence NNS (NNC or NNG), only one 
is a stop codon. Nevertheless, only 51% ((31 /32) 21 ) of 
10 Fn3-like sequences with 21 NNS-encoded positions 
would be expected to be free of stop codons; the re- 
sulting protein diversity would be about half of the origi- 
nal DNA diversity. 

To avoid this problem, we synthesized each DNA li- 



Directed Evolution of Antibody Mimics 
935 





■R10 


rb r 





0 TNF-a IL-1 IL-13 



Figure 2. Specificity of Binding of the 1 °Fn3 Pool Translated from 
the Original, Randomized Library and from the Pool Obtained after 
Ten Rounds of Selection for Binding to TNF-a 
The library or the PCR product of the elution after R10 was tran- 
scribed and translated in vitro, in the presence of ^S-methionine 
but without forming RNA-protein fusion. The fraction of the resulting 
free protein bound to underivatized Sepharose, to TNF-a-Sepharose 
(~10 (iM TNF-a in the target- 10 Fn3 binding mixture), to IL- 
1a-Sepharose (~30 (iM IL-1 a), and to IL-13-Sepharose (~50 (iM IL- 
13) is compared. The R10 selected protein pool binds specifically 
to TNF-a-Sepharose, whereas the RO library binds at a comparable, 
low background level to all three immobilized cytokines. 

brary in three fragments, each with one randomized 
region and encoding a C-terminal FLAG peptide. The 
DNA fragments were transcribed and translated into 
RNA-polypeptide fusions and affinity purified to recover 
only the fusions that expressed the FLAG peptide, i.e., 
that had no internal stop codons. Once the DNA frag- 
ments encoding the full-length peptides were assem- 
bled into the 94 residue 10 Fn3-like domain, the sequence 
diversity of the assembled sublibraries was determined 
by the amount of RNA-protein fusion made in vitro, in 
this case, 2 pmol, which corresponds to 10 12 full-length 
molecules. The sequencing of 75 full-length clones (data 
not shown) from the sublibrary with three randomized 
loops, before the selection began, confirmed the ab- 
sence of stop codons. 

In Vitro Selection on TNF-a-Sepharose 

Significant binding of the RNA-protein fusion pool to 
TNF-a-Sepharose was detected after nine successive 
rounds of selection and amplification (R9), when 0.7% 
of the fusion pool bound to the immobilized target. After 
an additional round (R10), 7% of the selected RNA- 
protein fusion pool bound to TNF-a-Sepharose, whereas 
the binding of the R1 0 fusion pool to underivatized Seph- 
arose remained under the detection limit of 0.25%. A 
comparison in binding efficiency of the free protein en- 
coded by the R1 0 pool to TNF-a and to two other cyto- 
kines, IL-1 a and IL-13, demonstrated specific binding 
to TNF-a (Figure 2). 



Amino Acid Sequences of Variants Selected 
on TNF-a-Sepharose 

The selected pool of TNF-a binding 10 Fn3 variants was 
found to contain clones with diverse sequences. Thirty- 
two clones from the R9 pool and 29 clones from the 
R1 0 pool were picked at random and sequenced. Thirty- 
eight of these 61 clones (62%) had unique amino acid 
sequences and illustrated the remarkable diversity of 
the target binding 10 Fn3-like domains isolated (Table 1 A). 
The fact that 90% of the selected clones originated in 
the library with all three CDR-like loops randomized sug- 
gests that the target may be bound cooperatively by 
several of these loops. We determined the dissociation 
constants from TNF-a for all the clones that were repre- 
sented more than once in R9 and R1 0 pools, and we 
found K d values in the range of 1-24 nM (Table 1A). 

The amino acid sequences of the selected clones 
revealed a number of conserved patterns (Table 1A). 
The most common motif found in the selected loops 
was PWA(S/T), which was seen in the DE loop of 54% 
of the clones; the more loosely defined sequence of 
PW(A/G) was seen in 67% of the clones. Approximately 
one-half of the selected clones showed conserved 
changes in the 3 sheet core of the protein, which had not 
been randomized intentionally (Figure 3A); presumably, 
those changes were due to the mutagenesis by Taq 
polymerase during PCR amplification or by reverse tran- 
scriptase during the synthesis of RNA-protein fusion. 
Further studies of the recurrent scaffold mutations will 
determine if these mutations improve stability or ligand 
binding of the selected 10 Fn3-like domains. 



In Vitro Selection in Solution and Affinity Maturation 

Due to the diversity of the selected R1 0 pool, four addi- 
tional rounds of selection (R11-R14) were conducted 
under increased selection pressure in order to isolate 
the highest-affinity molecules in the pool. The selection 
rounds R11-R14 were performed in solution, using bio- 
tinylated TNF-a at the concentration of 0.5 nM, which 
was approximately 1 0Mold lower than the concentration 
of TNF-a-Sepharose during R1-R10. In addition, the se- 
lection was performed at an elevated temperature (30°C 
versus 4°C) and with extended washes of the bound 
complexes. Twenty-two clones from the R1 4 pool were 
chosen at random and sequenced and were found to 
represent 15 different loop sequences (Table 1 B). The 
lowest K d values measured for these sequences, 90 pM 
(for clone T14.25) and 110 pM (for clone T14.20) were 
an order of magnitude lower than those observed in the 
R9 and R1 0 pools. A parallel strategy incorporated error- 
prone PCR into the amplification of DNA between 
rounds of selection [64]. The diverse DNA pool from R8 
of the TNF-a-Sepharose selection was subjected to two 
rounds of selection in solution with mutagenesis at the 
frequency of 0.8%-4.8% (M9) and of 0.8%-2.4% (M10), 
then to two more rounds of selection without the muta- 
genesis (M11-M12). The 20 M12 clones tested showed 
tighter binding to TNF-a than had the clones selected 
using the two earlier selection protocols (Table 1 C); the 
tightest binding of TNF-a was seen in M12.04, which 
had the observed K d of 20 pM. The M12.04 sequence 
contains a 6 residue deletion in the FG loop as well as 
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Table 1 . Loop Sequences and Binding Properties of 10 Fn3-Derived Protein Pools and Variants 



A R09 pool' 
R1 0 pool' 
T09.07 
T10.06 
T09.12 
10.15 
T09.05 
T09.03 
T10.33 
T09.34 
T09.14 
T09.33 



T14.25 
T14.20 
T14.24 
T14.22 
T14.10 
T14.02 
T14.06 
T10.06 
T14.21 
T09.34 
T14.03 
T14.01 
T14.11 



xxxxxxx 
xxxxxxx 

ASPPMWC 
NRSGLQS 
RPTSNPP 
HTERSFP 
TTRHSPV 
RPNPRLS 
SPPNDAH 
RTPASPH 
YRHTYRD 
NRSGLQS 



DTHNAYN 

RNIYPIA 

NRSGLQC 

RPGRTYS 

RPGRTYS 

MTPHNHV 

TRTNAST 

NRSGLQS 

RPGRTYS 

RTPASPH 

TRTNANT 

DNSRPNT 

NPNRSFA 



xxxx 
xxxx 

PWAT 
PWAS 
PWAS 
PWGS 
PWAT 
GLFS 
GSKS 
LLWP 
PWAT 
PWAS 

HPEV 
PWAS 
PWAS 
PWAN 
PWAN 
TGNA 
NFWW 
PWAS 
PWAN 
LLWP 
NNPP 
PWGS 
PWAS 



xxxxxxxxxx 
xxxxxxxxxx 

EYLPEWNMTQ 

DKSDTYKYDD 

AQTGHHLHDK 

EHYRDTGTGH 

MPTNWRFPHR 

PKETSNIFIA 

DQQSYTYYSN 

PTHMLKPQSM 

DTGYDVHTKR 

SNVGRLDTRY 



NHHMPLRIFG 
DKSDTYKYDD 
DQRDTYKYDD 
FPTGYPLTEM 
FPPGYPLTEM 
PHHGHFDLEP 
SPDETSAYSE 
DKSDTYKYDD 
YTHSTPMQDE 
PTHMLKPQSM 
PDGSRHMLTK 
TSECHKLSST 
AQTGHHLHDK 



17 ± 6 
13 ± 6 

17 ± 1 

18 ± 2 
24 ± 2 



CM12pool c 
M12.04 
T14.25 
M12.18 
M12.26 
M12.12 
M12.13 
M12.16 
M12.21 
M12.23 
M12.09 
M12.05 
M12.08 
M12.19 
M12.22 
M12.24 
M12.01 
M12.14 
M12.07 
M12.25 
M12.15 



XXXXXXX 
SMTPNWP 
DTHNAYN 
ASPPMW 
RPPADLN 
EQSPTYG 
RPGRTYS 
RPGRTYS 
YRHTYRD 
RTMPVTA 
RPGRAYS 
SPPNDAH 

RNNQYTP 
RPASNPA 
NRSGLQS 
NRSGLQS 
THDNVPA 
RSGNRTT 
NRSGLQS 
RNAKDPG 



XXXX 
PWAS 
HPEV 
PWAT 
PWGT 
GSKS 
PWAN 
PWAS 
PWAT 
PWAS 
PWAN 
GSKS 
APKA 
ELNP 
PWAS 
PWAS 
PWAS 
PWAS 
PWAT 
PWAS 
PWGT 



XXXXXXXXXX 
HRDT 

IHHMPLRIFG 

DESWSDRSMD 

HRDT 

IEKDRIPLFG 

FPPGYPLTEM 

DKSGTYRYDD 

DAGYDVHTKR 

SATPSRPNVH 

FPPRYPLTEM 

DQQSYTYYSN 

SHRNHFHVET 

QNGTPRVIYG 

AQTGHRLHDK 

PNVGRLDTRY 

DESDTYKYDD 

LYTGNHRPEH 

THNSTAQPEY 

SNVGRLDTRY 

ATNPGPTQHR 



0.4 ± 0.1 
0.02 ± 0.01 
0.09 ± 0.02 



1.6 ± 0.2 

1.7 ± 0.4 

4± 2 
>1000 



The first two digits of the name of the clone show the round of selection from which the clone was derived. "Number" indicates the number 
of clones with the amino acid residue sequence in loops BC, DE, and FG that is identical to the clone listed. Since no systematic difference 
was observed between the R9 and R10 pools, the entry in Table 1A is the combined number of clones isolated from these two pools. The 
amino acid residue sequence of loop BC, DE, or FG in the specified clone is shown. Dissociation constant (KJ for the binding of the free 
protein form of the specific 10 Fn3 clone to trimeric TNF-a, at room temperature, is shown. The values reported and the associated errors were 
derived from 2-4 independent measurements. PPT indicates protein sample was not soluble under conditions used for Ka determination. 
'After nine (R9) and after ten (R10) rounds of selection using TNF-a-Sepharose. 
"After four rounds of selection starting with the R10 pool and using biotin-TNF-a (R14). 

c After two selection rounds with error-prone PCR, followed by two selection rounds with standard Taq PCR, starting with the R8 pool and 
using biotin-TNF-a (M12). Wild-type 10 Fn3 sequences are bold. 



the frequently selected PWAS tetrapeptide in the DE 
loop. Two other high-affinity TNF-a binding variants that 
evolved using error-prone PCR, M12.26 and M12.08, 
resulted from major deletions in the randomized loops 
(six residues in the FG loop and five residues in the BC 
loop, respectively); in contrast, no deletions were found 



in any of the characterized variants derived without error- 
prone PCR. These results demonstrate that low-level, ran- 
dom mutagenesis late in a selection can improve both 
the binding affinity of selected antibody mimics (20 pM 
versus 90 pM) and the speed with which they can be 
selected (12 rounds versus 14 rounds). 
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Figure 3. Alignment of the Amino Acid Residue Sequences of the Wild-Type 1 °Fn3 and of the Selected Fn3 Variants 

(A) After nine (R9) and after ten (R10) rounds of selection using TNF-a-Sepharose. 

(B) After four rounds of selection starting with the R10 pool and using biotin-TNF-a (R14). 

(C) After two selection rounds with error-prone PCR, followed by two selection rounds with standard Taq PCR, starting with the R8 pool and 
using biotin-TNF-a (M12). 

The residues identical to those of wild-type 1 °Fn3 are represented by dots, deletions are represented by dashes, and the amino acid residues 
different from the corresponding wild-type residues are identified by one-letter code. The variable-loop sequences that appeared in several 
different clones are highlighted, as are the changes at the two most variable positions outside the randomized loops, positions #1 8 and #56. 



Biophysical State of Fn3-Based Antibody Mimics 
To confirm that the selected antibody mimics fold into a 
three-dimensional structure, the wild-type human 10 Fn3 
and two representative variants, T09.07 and M12.21, 
were expressed in £. co/i, purified using the hexahisti- 
dine affinity tag, and incubated with thermolysin at tem- 
peratures between 4°C and 50°C. As expected for a 
highly thermostable protein, wild-type 10 Fn3 resisted 
proteolysis throughout the temperature range tested, 
suggesting that the sample was completely folded under 
those conditions. In contrast, a fraction of the protein 
in T09.07 and M1 2.21 samples was found to be sensitive 
to proteolysis at 37°C and 25°C, respectively (Figure 4). 
Approximately 50% of the T09.07 domain resisted the 
protease after a 1 5 min incubation of 42°C, and approxi- 
mately 40% of M12.21 resisted the protease after such 
an incubation at 30°C. This finding demonstrates that, 
whereas the two variants are less stable than the wild- 
type 10 Fn3, they are predominantly folded at room tem- 



perature, the temperature at which their binding was 
characterized; in other words, the selected loop se- 
quences bind their targets in the context of a folded 
domain. 

In addition, analytical gel filtration showed that the 
apparent molecular weight of purified, soluble wild-type 
10 Fn3, T09.07, and M12.21 at 0.1-1 mg/ml is consistent 
with the variants being monomeric. Consequently, the 
Fn3 samples used to determine their K d , which were 
three orders of magnitude more dilute, were likely to 
interact with TNF-a as monomers. 

Fn3-Based Antibody Mimics in Protein Microarrays 

To demonstrate the utility and the specificity of 10 Fn3- 
based antibody mimics, we tested their activity when 
immobilized onto a solid surface. In these studies, we 
included a 10 Fn3 variant that binds to leptin, clone L9.20, 
which had been selected from the same 10 Fn3-based 
library and under similar selection conditions as the 
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Figure 4. Sensitivity of Selected Domains to Thermolysin 
Sensitivity to thermolysin with increasing temperature for (A) wild- 
type, human 10 Fn3, (B) T09.07, (C) variant M1 2.21, and (D) unfolded 
control (ras-derived peptide [1-189] fused to GST [GST-ras]). Lane 
1 : molecular weight standards, labeled in kDa. Lanes 2-9: samples 
after 15 min incubation at increasing temperature, with thermolysin 
at 20 ng/ml (T) or without thermolysin (0). Lane 2, 4°C, 0; lane 3, 
25°C, 0; lane 4, 4°C, T; lane 5, 25°C, T; lane 6, 30°C, T; lane 7, 37°C, 
T; lane 8, 42°C, T; lane 9, 50°C, T. The arrow on the right points at 
the position of full-length Fn3 or full-length GST-ras. No proteolysis 
of the wild-type was observed between 4°C and 50 C, but the se- 
lected variants were degraded, with about a half of the full-length 
domain gone at 42°C for T09.07 and at 30°C for M12.21. Most of 
the control protein, the fusion of glutathion-S-transferase with an 
unstructured, ras-derived peptide (GST-ras), was degraded already 
when exposed to thermolysin at 4°C. 

TNF-a binding variants (L.S., A. Kovtun, Y. Chen, H. Gao, 
P. Amersdorfer, B. Kreider, and R.W.W., unpublished 
data) and which had the K d for leptin of 0.8 nM. 

To generate the microarrays, a Watson-Crick immobi- 
lization scheme was used to attach the 10 Fn3 variants 
to a glass surface [65]. In this method, mRNA-protein 
fusions were prepared using DNA linkers with se- 
quences that were clone specific. This allowed the anti- 
body-mimic fusions to self -assemble onto an array spot- 
ted with the capture probe complementary to their 
respective DNA linkers. Following RNase A digestion of 
the RNA portion of the fusion, the resulting DNA-tagged 
10 Fn3 variants were allowed to hybridize to the array, 
thereby immobilizing the antibody mimics via their car- 
boxyl termini and exposing their target binding loops to 
solution. 

In the first test, the binding properties of wild-type 
10 Fn3, of T09.12 (a 10 Fn3 variant selected for binding to 
TNF-a), and of L09.20 (a 10 Fn3 variant selected for bind- 
ing to leptin) were compared when these three domains 
were immobilized on a microarray. Each of the antibody 



A B 




Figure 5. Specific Capture of TNF-a and of Leptin by an Antibody 
Mimic Immobilized on a Solid Surface 

Two identical microarrays were constructed by immobilizing DNA- 
puromycin-Fn3 constructs on a glass surface that contained a 
TNF-a binding Fn3 variant, T09.12 (T). a leptin binding Fn3 variant, 
L09.20 (L), and wild-type human 10 Fn3 (W). Capture oligonucleotides 
complementary to the DNA linker attached to the Fn3 protein were 
used to direct the constructs to specific locations. The fourth set 
of features (C) was created by spotting onto the chip an oligonucleo- 
tide not complementary to any of the DNA sequences present in 
the fusions. To examine the binding properties of the immobilized 
Fn3, the first fusion microarray (A) was exposed to 1 00 nM biotinyl- 
ated TNF-a, and the second microarray (B) to 100 nM biotinylated 
leptin. Next, bound target was visualized using a Cy3-labeled anti- 
biotin monoclonal antibody. 



mimics was prepared as an 35 S-labeled RNA-protein fu- 
sion, digested with RNase, and allowed to self-assemble 
into the predetermined position on a glass slide. The 
immobilization onto the array was confirmed by radioac- 
tive imaging of the 35 S-labeled, protein portion of the 
fusion (data not shown). After an incubation with the 
target proteins, T09.12 and L09.20 were found to bind 
their respective targets, but not to bind each other's 
targets. As expected, the wild-type 10 Fn3 bound neither 
TNF-ct nor leptin (Figure 5). 



1500 




15 20 25 30 35 
m/z (kDa) 

Figure 6. Specific Capture of TNF-a from a Biological Fluid by an 
Antibody Mimic Immobilized on a Solid Surface 
DNA-puromycin-Fn3 constructs of the T09.1 2 variant (A) and of the 
wild-type 1 °Fn3 (B) were hybridized to a glass cover slip arrayed 
with an immobilized oligonucleotide complementary to the DNA 
linker. After washing, the surfaces were exposed to TNF-a (1 .5 mg/ 
ml, in 90% v/v PBS, with or without 10% fetal bovine serum), the 
dried chip was spotted with MALDI matrix (200 |j,m feature size), 
and a single feature was analyzed with a Voyager DE MALDI-TOF 
mass spectrometer (PerSeptive Biosystems). A signal at 17.4 kDa, 
which corresponds to the expected molecular mass of monomeric 
TNF-a, was detected on the feature that contained T09.12 protein 
(A), but not on the feature that contained wild-type 10 Fn3 (B). This 
demonstrates that the Fn3 variant selected for the binding of TNF-a 
can capture its target specifically from the background of 1 0% FBS. 
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In the second test, the TNF-a binding antibody mimic 
T09.1 2 was immobilized in the same fashion onto a glass 
slide coated with its capture oligonucleotide. Following 
an incubation with TNF-a and subsequent printing with 
matrix material, the features were analyzed for the pres- 
ence of TNF-a using mass spectroscopy. T09.12 was 
shown to capture TNF-a when this target protein was 
spiked into either PBS buffer or into PBS with 10% 
fetal bovine serum (FBS); no other predominant proteins, 
such as those present in bovine serum, were detected 
(Figure 6). 

Together, the results of these two experiments dem- 
onstrate that the inherent specificity of the 10 Fn3-based 
antibody mimics was preserved when these mimics 
were immobilized on a solid surface. 

Conclusion 

The large library size made possible by the in vitro mRNA 
display technology allowed us to randomize all three 
CDR-like loops of 10 Fn3 and to sample 10" -fold more 
sequences than was possible in the phage-display se- 
lection reported by Koide et al. [62]. Most likely, these 
improvements in library construction and in selection 
methodology were the major factors that contributed to 
the following improvements in the 10 Fn3-based antibody 
mimics: (1) the higher affinity (20 pM [K d ] versus 5 |_iM 
[IC 5() ] [62]), (2) the increased specificity, and (3) the larger 
population of variants with comparable binding proper- 
ties (100 versus 1 [62]). 

This work introduces the 10 Fn3 scaffold as a viable 
alternative both to antibody fragments and to single- 
chain antibodies in the development of high-affinity, 
specific binding reagents. For example, two recent sin- 
gle-chain antibody selections, which used the Human 
Combinatorial Antibody Library and ribosome display, 
yielded antibody fragments against bovine insulin [11] 
and against guanine-quadruplex DNA [66] with K d as 
low as 80 pM and 1 25 pM, respectively, affinities close to 
that described in this paper. Furthermore, 10 Fn3-based 
antibody mimics may be preferable to antibodies in ap- 
plications that favor small domain size or the absence 
of disulfide bridges. 

One potential application of 10 Fn3-based antibody 
mimics is in diagnostic protein microarrays [67-69], 
where a large series of specific, 10 Fn3-based antibody 
mimics will be immobilized on a solid surface. Antibody 
mimics on such a protein microarray could be used to 
capture, and thus to identify and quantify, their respec- 
tive antigens from complex biological fluids. In an effort 
to develop a protein capture array, we have selected 
10 Fn3-based antibody mimics against a range of cyto- 
kines and cytokine receptors with affinity and specificity 
similar to that described here (L.S., A. Kovtun, Y. Chen, 
H. Gao, P. Amersdorfer, B. Kreider, and R.W.W., unpub- 
lished data). The ability of two such antibody mimics 
against different cytokines to capture their targets spe- 
cifically when immobilized on a glass slide (Figure 5) 
increases our confidence in the feasibility of diagnostic 
protein arrays based on 10 Fn3-like antibody mimics. 

Significance 

This work introduces the 10 Fn3 scaffold, a 94 residue 
domain with an immunoglobulin-like fold as an alter- 



native to antibody fragments and to single-chain anti- 
bodies in the development of subnanomolar, specific 
binding reagents. The 10 Fn3-based antibody mimics 
may offer advantages over antibodies for applications 
that favor their small size, their stability under reducing 
conditions, and their ease of production in E. coli. The 
mRNA display technology used here to select the 10 Fn3 
variants that bind TNF-a with 20 pM-24 nM affinity 
allows the creation of libraries of over 10 12 different 
sequences, which increases the likelihood that do- 
mains with desired binding properties will be found. 
Future libraries based on different types of p-sandwich 
scaffolds will allow the selection of antibody mimics 
with the wide range of properties required in research, 
diagnostic, and therapeutic applications. 

Experimental Procedures 
Library Construction 

Each library was constructed from three segments prepared either 
by PCR of the 1 °Fn3 clone (for wild-type segments), or by DNA 
synthesis (for segments comprising random sequences; Oligos, 
Etc.). Each segment that included a randomized loop was made 
from an overlapping pair of oligonucleotides, one of which contained 
the randomized region. The randomized positions were encoded by 
NNS, where N represents an equimolar mixture of all four nucleo- 
tides and S represents the equimolar mixture of G and C. For ran- 
domized BC, DE, or FG loops, positions encoding amino acid resi- 
dues 23-29, 52-55, or 77-86, respectively, were substituted with 
NNS. In addition to the 1 °Fn3-based sequences, these gene frag- 
ments contained 5' Tobacco Mosaic Virus (TMV) untranslated region 
and T7 promoter, as well as the sequences encoding N-terminal 
hexahistidine and C-terminal FLAG-epitope purification tags [5], and 
5' and 3' Earl restriction endonuclease recognition sites. Each pair 
of oligonucleotides was annealed and extended using Klenow (NEB) 
into a double-stranded DNA fragment, which was transcribed in vitro 
(T7-MEGAshortscript™, Ambion). The resulting mRNA was ligated 
to the puromycin-containing DNA-polyethylene glycol (PEG) linker 
(5'dA 1B PEG2dCdCPur) using DNA ligase (Promega) and a DNA splint 
oligonucleotide. The in vitro translation of the ligated mRNA in rabbit 
reticulocyte lysate (Ambion), in the presence of ^S-methionine, re- 
sulted in an mRNA-peptide fusion. The fusion was purified by Oli- 
go(dT)-Cellulose (Type 7, Amersham Pharmacia) and reverse tran- 
scribed (Superscript™ II, GIBCO) to generate a DNA strand 
complementary to the fusion mRNA. Full-length DNA/mRNA-pep- 
tide molecules were purified by affinity chromatography first on Ni- 
NTA agarose (Qiagen) and then on M2 Anti-Flag® Agarose (Sigma). 
The genetic information encoding each purified fragment was ampli- 
fied by PCR, which removed the His 6 tag on the N terminus of the 
molecule, then cleaved by Earl (NEB) and ligated to the appropriate 
randomized or wild-type fragments of DNA to make the three librar- 
ies. Finally, 2 pmol of each DNA library were combined to prepare 
the 1 °Fn3-based, master library. The DNA master library was pro- 
cessed into a DNA/mRNA-protein library following the procedure 
described for the individual fragments; no affinity purification was 
used in rounds one and two, whereas purification on M2-agarose 
was used in rounds three to ten. Forty picomoles of DNA/mRNA- 
protein fusion library molecules, the equivalent of 20 copies of 4 X 
1 0 12 different sequences, were recovered, then subjected to the first 
round (R1) of the selection. 

In Vitro Selection on TNF-a-Sepharose 

TNF-a (PeproTech) was immobilized on Epoxy-Activated Seph- 
arose™ 6B (Amersham Pharmacia) and blocked in Binding Buffer 
(R1-R6: 50 mM HEPES [pH 7.4], 0.02% Triton, 0.1 mg/ml sheared 
salmon sperm DNA [Ambion]; R7-R10: as in R1-R6, with additional 
1 mg/ml BSA [Sigma]). 

In the first step of a selection, the DNA/mRNA-protein library (R1 : 
40 pmol, R2: 24 pmol, R3-R10: 0.1-2 pmol) was precleared with 
Epoxy-Sepharose that had been subjected to the derivatization pro- 
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cedure in the absence of TNF-ct. The supernatant was incubated 
with TNF-ct-Sepharose, which was then washed 3x with binding 
buffer, eluted with 0.1 M KOH, and neutralized. The next round of 
selection began with PCR amplification of the eluted DNA, followed 
by transcription, translation, and reverse-transcription into an en- 
riched, DNA/mRNA-protein pool. 

In Vitro Selection in Solution 

Precleared, 0.1 nM DNA/mRNA- 10 Fn3 fusion library was incubated 
with 0.5 nM biotinylated TNF-a in binding buffer (50 mM HEPES [pH 
7.4], 0.02% Triton, 0.1 mg/ml sheared salmon sperm DNA [Ambion]; 
1 mg/ml BSA [Sigma]), then the complex was captured with Dyna- 
beads® M-280 Streptavidin paramagnetic beads (Dynal) that had 
been preblocked in the binding buffer. The beads were washed with 
binding buffer for 1,15, and 30 min in the case of R11 and R12, or 
for 1 min, followed by nine 10 min washes, in the case of R13-R14. 
DNA was eluted with KOH and used to make the next generation 
DNA/mRNA- 10 Fn3 fusion library. Round 14 differed from R11-R13 
in that the selection was performed at 30°C rather than at 4°C and 
in the presence of additional 1 50 mM NaCI. 

The DNA pool eluted after R8 was amplified using error-prone 
PCR, with the pool divided into seven equal parts and mutagenized 
at the target frequency of 0.8%, 1 .6%, 2.4%, 3.2%, 4.0%, and 4.8% 
using the Diversify™ PCR Random Mutagenesis Kit (Clontech). The 
seven PCR reactions were combined, and DNA/RNA-protein fusion 
was made from the mixture and subjected to a round of selection 
in solution. Before the second mutagenic round, M10, error-prone 
PCR was performed in three separate reactions, at 0.8%, 1 .6%, and 
2.4%. The two remaining rounds, M11 and M12, were performed 
using standard Taq PCR. Except for the mutagenesis, the selection 
conditions for M9-M12 were the same as for R11-R14. 

The leptin binding Fn3 variant L09.20, which was used to examine 
the specificity of TNF-n binding variants in protein microarrays, was 
selected in solution, using biotinylated leptin and a protocol similar 
to the one described above. No error-prone PCR was used in the 
leptin selection, and the concentration of biotinylated leptin was 
decreased gradually from 250 nM in R1 to 14 nM in R9 (L.S., A. 
Kovtun, Y. Chen, H. Gao, P. Amersdorfer, B. Kreider, and R.W.W., 
unpublished data). 

Determination of Dissociation Constants 

For R9, R1 0, R1 4, and M1 2 pools, as well as for each characterized 
clone derived from these pools, 11 samples of 0.25 nM, in vitro- 
translated, ^S-methionine-labeled protein were incubated with bio- 
tinylated TNF-a at concentrations between 17 pM and 23 nM, in 
200 jjlI 10 mM HEPES (pH 7.4), 150 mM NaCI, 1% BSA, 0.02% 
Triton, for 1 hr at room temperature. Subsequently, each sample 
was loaded on a presoaked, SAM2R Biotin Capture Membrane (Pro- 
mega) using a 96-well, Easy-Titer™ ELI FA system (Pierce). Under 
vacuum, each spot was washed with 200 |jJ of H BS (pH 7.4), 1 % BSA 
0.05% Triton; next, the entire membrane was rinsed in the same buf- 
fer and air-dried. The membrane was exposed with a Storage Phos- 
phor Screen (Molecular Dynamics) overnight, and the intensities of 
the resulting individual spots were quantified using a STORM 860 
phosphorimager with the ImageQuaNT densitometry program 
(Molecular Dynamics). The Ka of the complex was determined by 
fitting the data to the unsimplified equilibrium binding equation S = 
S m „ / B t ([B t + T, + KJ / 2 - V{[(B, + T t + KJ / 2] 2 - B.TJ + NS), 
where S is ^S signal, B, is the input concentration of the target 
binding Fn3 domain, T, is the input concentration of the target, Ka 
is the dissociation constant, and NS is the signal from nonspecific 
background binding. The program KaleidaGraph (Synergy Software) 
was used to fit the data to the equation and to calculate K,, values. 

Expression and Biophysical State of Fn3-Based 
Antibody Mimics 

Antibody mimics were expressed in E. coli BL21(DE3) pLysS (In- 
vitrogen) from pET-based vectors (Novagen; pET25 for T09.07, 
pET9a for M12.21 and for the wild-type). The sequences cloned 
onto the C terminus of the clones for detection, purification, and 
attachment to the solid phase were HHHHHHGSGSKKKK for T09.07 
andforM12.21 and LEGAPVPYPDPLEPRHHHHHHGSGSKKKK for 
the wild-type 10 Fn3. Five hundred milliliters of LB with the appropriate 



antibiotic (carbenicillin for T09.07, kanamycin for M12.21 and for 
wild-type 10 Fn3) were inoculated with 10 ml of an overnight culture 
and grown at 25°C, with shaking at 275 rpm, to OD 600 of 0.6-0.8. 
The culture was induced with 0.5 mM IPTG, shaken for another 16 
hr, and harvested by centrifugation at 3000 X g. The pelleted bacte- 
ria were dounce-homogenized and freeze-thawed in 50 ml of lysis 
buffer (50 mM phosphate [pH 8.0], 0.5 M NaCI, 5% glycerol, 10 mM 
CHAPS, 25 mM imidazole, 1 mM PMSF, 25 ng/ml DNase I [Roche], 
2 |ig/ml aprotonin [Roche], and 1 EDTA-free Protease Inhibitor 
Cocktail tablet [Roche]). After the addition of another 25 ng/ml of 
DNase and 1 mg/ml of lysozyme, the mixture was incubated at room 
temperature for 15 min, then centrifuged for 30 min at 27,000 X g. 
The supernatant was incubated for 30 min at 4°C with 1 ml of TA- 
LON™ Superflow™ Metal Affinity Resin (CLONTECH) prewashed 
with 4 X 10 ml of wash buffer (50 mM phosphate [pH 8.0], 0.5 M 
NaCI, 5% glycerol, 5 mM CHAPS, 25 mM imidazole). The His-tagged 
Fn3 domains were eluted with 12 X 0.5 ml of 50 mM HEPES (pH 
7.4), 150 mM NaCI, 5 mM CHAPS, 50 mM EDTA, 1 mM PMSF, 2 
l-ig/ml aprotonin, and 1 EDTA-free Protease Inhibitor Cocktail tablet, 
concentrated to 1 ml in CentriPrep-10 (Millipore), and exchanged 
into HBS using a NAP-10 column (Amersham Pharmacia). The final 
yield was 1.8 mg/l for T09.07, 3.6 mg/l for M12.21, and 18 mg/l for 
the wild-type 1 °Fn3. 

Protease digestion of 2 |j.g samples of the purified Fn3-based 
domain or of the control peptide (Ras#1 -189) fused to the C terminus 
of glutathione-S-transferase (GST-ras) was performed in 30 jjlI of 
HBS, 5 mM MgCI 2 , 1 mM DTT, with 20 ng of thermolysin (Sigma- 
Aldrich) in water baths maintained at 4°C, 25°C, 30°C, 37°C, 42°C, 
and 50°C. After a 15 min incubation, each reaction was stopped by 
the addition of 10 |xl of 4x SDS-PAGE buffer. Twenty microliters of 
each sample was separated on an SDS-PAGE gel (Novex; 16% 
Tricine for selected Fn3-like domains, and 1 0%-20% Tris-Glylcine 
for GST-ras). The resulting gels were destained, and the relative 
amount of protein in each band was quantified using a Chemilmag- 
er™ 4400 Low Light Imagine System and AlphaEase™ software 
(Alpha Innotech Corporation). The decrease in intensity of the band 
containing full-length Fn3 was used to monitor the extent of proteo- 
lytic degradation. 

The aggregation state of purified wild-type 1 °Fn3, of T09.07, and 
of M12.21 were compared using size-exclusion chromatography on 
a Waters™ 600E Multisolvent Delivery system, with a 7.8 mm X 15.0 
cm (5 |xm) QC PAK GFC 200 column (Toso Haas). Ten microliters 
of each sample at 0.1-1 .2 mg/ml was loaded onto the column and 
separated in HBS, at 1 ml/min, for 20 min. Protein absorption at 280 
nm was monitored, and the retention time of the major peak, which 
corresponded to the protein of the highest molecular weight species 
present in the sample, was compared to the retention time of molec- 
ular weight standards. The retention times of wild-type 10 Fn3, of the 
T09.07 variant, and of the M12.21 variant were found to be 6.33, 
6.29, and 6.26 min, respectively. These values fall between the reten- 
tion times of the molecular weight standards ribonuclease A (13.7 
kDa) and aprotonin (6.5 kDa), which are 6.01 and 9.08 min, respec- 
tively, suggesting that the molecular weights of the Fn3 domains 
studied are between 7 and 1 4 kDa. 

Construction of Antibody-Mimic Protein Arrays 
Each of the antibody mimics to be immobilized was prepared in the 
form of an mRNA-Fn3 fusion as described above, but with specific 
DNA-puromycin linkers to encode the identity of the Fn3 variant in 
the fusion. The four linkers used were TAGLN4 (5 ' - AAAAAAAAAAAC 
GCTATGAGTGTCTTGCC-pur), TAGLN8 (5 ' -AAAAAAAAAAACTTG G 
TGTGTGTCTTGCC-pur), TAGLN9 (5' -AAAAAAAAAAAC GCTCGCT A 
TGAGTGTCC-pur), and TAGLN16 (5'-AAAAAAAAAAAATGACGCTC 
TTGGTGTCC-pur) and were prepared with an automated DNA syn- 
thesizer (PE BioSystems Expedite 8909) using conventional phos- 
phoramidite chemistry and reagents from Glen Research. 

Fouramino-modified DNA capture probes that were complemen- 
tary to 16 nucleotides of the TAGLN linkers were synthesized: 
TAGCP4 (5'-CAAGACACTCATAGCG-3'), TAGCP8 (5'-CAAGACA 
CACACCAAG-3'), TAGCP9 (5'-ACACTCATAGCGAGCG-3'), and 
TAGCP16 (5'-ACACCAAGAGCGTCAT-3'). The 500 u,M capture oli- 
gonucleotides in 50 mM sodium carbonate (pH 9) were spotted 
onto an amine-reactive glass surface with a 3-axis robot (MicroGrid, 
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BioRobotics), resulting in 200 |jim features with 600 |jim pitch. Each 
subgrid contained 24 spots of a single capture probe. 

The mRNA-Fn3 fusion was prepared as described under Library 
Construction, except that the TAGLN series of puromycin-con- 
taining linkers was used. The fusion that contained the Fn3 variant 
T09.12, which binds TNF-ct, was ligated to TAG8LN; the fusion that 
contained L09.20, which binds leptin, was ligated to TAG4LN; and 
the fusion that contained wild-type 10 Fn3 was ligated to TAG16LN. 
The RNA portion of the fusion was removed using RNase A (NEB). 
A mixture of the resulting constructs, which contained the encoding 
DNA oligonucleotide-puromycin linkers and the Fn3 proteins, was 
applied to the microarray for 1 6 hr at room temperature. The hybrid- 
ized slide was washed in SSC and the slide was air-dried, then 
blocked with TBS, 0.05% Tween-20, 2% BSA. 

To test the immobilized antibody mimics for target binding, re- 
combinant target (TNF-a or leptin; PeproTech) was biotinylated with 
EZ-Link NHS-LC-LC (Pierce) and dialyzed against PBS. MALDI-TOF 
of the resulting conjugates confirmed that most of the target mole- 
cules had been modified with a single biotin molecule. 100 nM of a 
specific target in TBS, 0.05% Tween-20, 0.2% BSA was applied to 
the surface of the microarrayfor2 hr; next, the slide was washed with 
TBS, 0.05% Tween-20. Cy3-labeled anti-biotin monoclonal antibody 
(Sigma) was applied to the microarray for 1 hr, and then the array 
was washed with TBS, 0.05% Tween. The slide was allowed to air- 
dry. Fluorescence laser scanning was performed with a GSI Lumonics 
ScasnArray 5000 system using 10 |j.m pixel resolution (Figure 5). 
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